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PRINCIPAL  INVESTIGATOR:  Niels  H.  Andersen,  Professor  of  Chemistry 
CONTRACTOR :  University  of  Washington 

CONTRACT  TITLE:  Magnetic  Resonance  Elucidation  of  Molecule /Macronolecule 
Interaction  Stereochemistry 

CONTRACT  PERIOD:  88FEB01  to  91JAN31 

RESEARCH  OBJECTIVES :  To  develop  and  experimentally  verify  data  collection 
and  reduction  protocols  suitable  for  the  definition  of  molecule  structure  in: 
1)  the  free  solution  state,  2)  the  membrane-lipid  associated  state,  and  3) 
protein-bound  states  that  represent  or  mimic  the  steps  of  molecular 
recognition  and  receptor  interaction.  The  specific  emphasis  will  be  on 
computer-aided  conformation  elucidation. 

The  anticipated  products  of  this  effort  will  be  computer  programs  that 
facilitate  NOESY-based  conformation  elucidation  and  are  applicable  to  data 
collected  for  small  molecules  in  various  states  including  dynamic  equilibria 
that  include  "receptor-bound"  states.  Application  of  these  programs  will  be 
illustrated  with  rigid  models  systems,  more  flexible  receptor  probes  and 
peptides  and,  if  available,  small  protein  and  DNA  systems. 


PROGRESS  (Year  02  +  3/90) : 

Program  NOESYSIM  has  been  revised  to  include  the  more  theoretically 
correct  Tropp  and  Woessner1  treatments  of  methyl  group  cross-relaxation  and 
internal  relaxation  respectively.  The  final  bugs  should  be  removed  in  all 
software  in  use  or  distributed  after  5/90. 

DISCON,  our  program  for  extracting  distances  (fully  corrected  for  spin 
diffusion  artefacts)  was  announced  at  the  30th  ENC  in  April  1989. 2  In  that 
presentation  we  used  an  experimental  data  set  for  f luoromethalone  (in  viscous 
media  such  that  tc  ;  1  vis)  as  a  test.  We  have  now  compared  four  different 
data  set3  corresponding  to  this  motional  regime.  Distances  out  to  4.0  A  were 
measurable  in  our  experiments.  The  rms  error  of  distance  estimation  for 
individual  experiments  ranged  for  0.33  -  0.49  A  (distances  involving  methyl 
groups  were  not  used  for  this  comparison  due  to  the  additional  geometry 
factors  that  confluence  c^^) .  If  the  "distances"  from  each  experiment  are 
viewed  as  independent  obervations,  precision  only  slightly  exceeds  accuracy. 
In  fact,  the  averages  of  experimental  distances  from  these  four  NOESY  data 
sets  display  much  greater  accuracy:  rms  error  of  averages,  0.23  A  (->  3.4A), 
0.30  A  (-»  4.0  A).  We  have  concluded  that  much  of  the  error  is  associated 
with  the  peak  volume  integration  routine  we  used  to  extract  the  signal 
intensities . 

Recently  we  collected  one  additional  data  set  for  f luoromethalone  paying 
particular  attention  to  peak  quantitation  techniques.  The  data  was  for  a 
non-viscous  solvent  in  which  this  steroid  displayed  rc  -  210ps.  As  usual  for 
small  molecules,  the  detection  limit  corresponded  to  3.4  A  rather  than  4 A. 
However  the  rms  error  was  much  improved  (0.13A). 
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Four  NOESY  data  sets  for  lysobactin,  a  rigid  backbone  undecapeptide 
macrolide,  have  been  subjected  to  DISCON  analysis  to  derive  a  new  set  of 
distance  constraints.  (We  had  previously  derived  constraints  from  one  of  the 
data  sets  using  a  modified  ISPA3} .  A  total  of  66  constraints  for  refinement 
(using  CHARMm  dynamics  and  minimization)  were  selected  from  the  DISCON 
output.  These  included  30  intraresidue,  23  sequential,  and  13  long-range 
constraints  with  the  following  bounds  assigned  from  the  DISCON  error  analy¬ 
sis  . 


distance  range  #  of  constraints  ave  bounds  range (A) 

2.0  -  2.7  A  21  0.23 

2.71-  3.1  A  23  0.29 

3.15-  4.4  A  22  0.54 

The  best  fitting  structures  from  the  refinement  displayed  only  two  bound 
violations  exceeding  0.4  A.  The  typical  values  of  £  (violations)  was  1.1-1. 6 
A,  quite  remarkable  for  a  set  of  66  constraints  with  an  average  experimental 
error  of  only  *0.18  A.  The  resulting  "structure"  also  is  of  high  precision; 
the  backbone  rmsd  for  the  3  best  fitting  structures  in  the  set  was  0.29  A. 

We  believe  these  examples  fully  substantiate  our  claim  that  DISCON  will 
provide  experimental  distances  of  sufficient  accuracy  for  high  precision 
conformational  analysis. 

One  further  test  of  DISCON  deserves  mention.  It  i3  based  on  simulated 
data  and  is  our  first  foray  into  the  area  of  DNA  structural  analysis.  Brian 
Reid  and  K.  Banks  of  this  department  had  previously  used  a  regularized  B-DNA 
model  of  a  self -complementary  dodecamer  [d(CGCGAATTCGCG) ] 2  to  generate  simu¬ 
lated  spectra*  for  a  test  of  a  refinement  strategy*  that  can  be  outlined  as: 
NOESYs  (at  several  rm)  initial  bounds  (from  a  linear  growth  ISPA 
analysis)  £°SPACE-.  model  #n  BKCALC_  compare  simulated  NOESY  to  "experiment"  and 
adjust  bounds^  DSPACE_  Refined  Model.  The  Reid/Banks  procedure  used  rm  «  30, 
60,  90,  150  ms  simulated  data  sets  to  estimate  observable  NOE  build-up  rates. 
We  applied  model-free  DISCON  analysis  to  the  60  and  150  ms  data  (with  no 
noise)  and  a  100  ms  rm  simulation  which  included  resonable  levels  of  noise. 
The  rms  deviations  (relative  to  B-Model,  the  coordinates  that  generated  the 
simulated  data)  of  distance  constraints  by  these  methods  are  summarized 
below. 


r.m.s.  deviation  (A) 


B-MODEL 

initial  rate 

final  bounds 

100  ms 

r<j  range  (A) 

ISPA  bounds 

4  BKCALC  cycles 

60  ms 

150  ms 

noise 

1.91-2.3 

0.09 

0.12 

0.01 

0.04 

0.02 

2.31-3.1 

0.11 

0.09 

0.02 

0.08 

0.06 

3.11-3.8 

0.33 

0.07 

0.03 

0.17 

0.20 

3.81-4.7 

0.77 

0.22 

0.17 

0.21 

0.41 

+  NOESYSIM  reproduces  the 
(relative)  when  we  use  r 

semi -empirical  BKCALC 
c  «  3.5  ns,  p*  -  0.19 

simulation 

s'1. 

to  better 

than  *3% 
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Experimental  Reaulta . 

Further  studies  of  the  binding  of  Trp  and  6-F-Trp  enantiomers  to  albumin 
have  confirmed  that  the  off-rates  are  such  as  to  complicate  the  derivation  of 
a  specific-receptor  bound  conformation.  Additional  studies  are  proposed. 

We  recently  worked  out  procedures  for  obtaining  NOESY  data  (including 
backbone  NH  connectivities)  for  yeast  a-factor  (a  13mer  pheromone)  and 
[Arg8] -a-neoendorphin  (1-8) ,  a  neuropeptide, in  micelle-associated  states. 

Other  workers  have  had  to  relie  on  spectra  in  D20  which  thus  lack  the 
structure  diagnostic  NH  interactions.  A  cross-section  of  the  NOESY  of  the 
latter  neuropeptide,  showing  the  cross-relaxation  spectrum  of  one  of  its 
backbone  Gly-NH  resonances,  appears  below. 


Clearly  we  will  be  able  to  obtain  the  required  matrix  of  data  necessary  for 
structure  elucidation.  The  yeast  a-f actor  data  shows  even  greater  S/N  and 
resolution . 

Endothelins,  members  of  a  new  class  of  peptide  vasoconstrictors,  have 
been  the  other  major  arena  for  data  collection.  Three  representative 
structures  are  shown  below 


*  I  6  8  ''  '  17  18 

ET-1  CSCSSLMDKKCVYFCHL  D  XXW 


KT-2  :  W  L 


rr-3 

ET-2  is  perhaps  the  most  potent  endogenous  vasoconstrictor  known.  As 
21-residue  bisdisulf ides, they  should  be  ammena.  >.  .o  conformation  elucidation 
by  2D  NMR  using  DISCON  and  restrained  dynamics  a rr.* -'ling. 

We  have  now  collected  NOESY  data  for  native  E'i-1  endothelin  in  a  variety 
of  water/glycol  mixtures  at  295  and  309  K  for  both  the  protonated  and 
neutral  His16  forms  in  both  protonic  and  deuteronic  media.  The  spectra  have 
been  completely  assigned  at  both  pHs  and  at  glycol  concentrations  from  20-ft0% 
(v/v) .  At  60%  glycol  the  temperature  gradients  for  backbone  amide-NHs  span  a 
wide  range  (-11.4  -•  +2.7  ppt/  C)  and  thus  NOESY  spectra  at  different  tempera¬ 
tures  provide  different  unambiguous  cross-peak  intensi-  ties.  Figure  SI 
illustrates  this  by  partially  annotated  NH/a,  /fi-H  region  NOESY  for  the  acid 
form  of  ET-1. 


3 


Quantitative  analysis  and  refinement  of  endothelin  data  sets  is  in  progress, 
but  a  number  of  structural  features  are  obvious  from  a  qualitative  analysis: 
1)  residues  9  -•  15  form  a  distorted  helix,  2)  segmental  motion  occurs  in  the 
backbone  for  at  least  a  portion  of  the  1  -»  4  region  and  also  probably  at  7, 

8,  3)  a  relatively  persistent  turn  occurs  in  the  vicinity  of  Ser5,  and  4)  the 
C-terminal  tail  is  not  a  random  coil  as  might  be  expected. 


WORK  PLAN  (year  03) : 

Software  Development.  We  plan  to  develop  automated  routines  for  conver¬ 
ting  DISCON  output  into  constraints  suitable  for  direct  use  in  XPLOR, s  the 
present  gold-standard  of  restrained  dynamics  annealing  programs  for  structure 
refinement  based  on  NOESY  data.  At  present6  this  program  is  typically  used 
with  very  wide  bounds  categories  such  as:  1.8-2. 7,  2. 0-3. 5,  2.3-5  and  2. 8-6 A. 

We  expect  DISCON  constraint  bounds  of  ±0.2  A  to  3. 4  A,  -  ±0.4  A  out  to  4*  A  should 
yield  much  more  precise  structures;  but  the  best  means  for  incorporating  such 
high  precision  constraints  into  XPLOR  is  problematic.  We  will  use  lysobactin 
as  the  primary  test  system  for  this  effort,  but  the  cyclic  hexapeptide 
somatostatin  analogs  will  also  serve  in  this  capacity  since  their  structural 
variability  has  been  better  documented  in  other  laboratories. 

Experimental  Work  and  Specific  Structure  Refinements 

We  will  continue  our  refinement  studies  for  endothelin  and  for  yeast 
a-f actor  and  [Arg8] -or-neoendorphin  in  micelles.  The  latter  study  will  be 
extended  to  a  13mer  neuropeptide,  dynorphin.  We  expect  to  use  the  DISCON- 
XLPOR  combination  in  the  final  stages  of  structure  refinement  in  all  cases. 

As  previously  noted,  we  will  continue  to  explore  the  binding  geometry 
associated  with  the  specific  binding  site  of  serum  albumin  for  tryptophan. 

We  will  evaluate  the  displacement  kinetics  of  Ac-G-H-K-W  using  19F-NMR  with 
L-6-f luorotryptophan  as  the  specific  binding  probe.  Once  we  have  found 
conditions  with  koff  z  50  s-'1  for  the  tetrapeptide,  we  will  use  exchange 
transferred  NOESY  to  derive  its  receptor-bound  conformation. 


Somatostatin  Analogs,  rigid  systems  for  testing  DISCON 

Dr.  Freidinger  (Merck)  has  kindly  agreed  to  provide  us  with  two  remarkably 
potent  somatostatin  analogs  (L363301  and  L363586,  below) : 


,67  8 

NMe-Ala  -Tyr  -D-Trp 

'll  ,10  1  9 

Phe  -Val  -Lys 

1363586 


6  7  8 

Pro  -Phe  -D-Trp 

•  ll  t  10  1  9 

Phe  -Thr  -Lys 


D-Pro  -Phe  -Trp 


'  ll  .  10  1  9 

Phe  -Thr  -Lys 


L363301 


This  class  of  cyclic  hexapeptides  has  been  extensivly  studied.  Kessler  et  al.7 
applied  every  known  2D  NMR  method  (including  E-COSY,  HETCORs,  and  a  quantitative 
ROESY)  to  analog  1  and  derived  a  detailed  solution  structure  and  compared  it  to 
the  high  resolution  x-ray  structure  of  the  Lys  Phe  substitution  analog. 


Our  studies  will  focus  on  L363586  and  363301  in  DMSO  containing  10-40%  H20. 
This  cryogenic  media  should  allow  us  to  perform  NOESY  spectra  at  sufficiently  low 
temperature  so  that  urc  >  3.  These  results  will  be  compared  to  a  room 
temperature  ROESY.  We  will  attempt  to  measure  every  3-bond  JCH  and  JHH  for  the 
molecule  and  use  amide-NH  A8/AT  gradients  to  determine  which  backbone  NHs  are 
involved  in  persistent  H-bonds . 
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Distant  constraints  derived  by  ISPA  (a  linear  growth  approximation) 3  and 
DISCON  will  be  compared  to  those  in  the  final  refined  structure  and  to  an 
idealized  structure  bearing  the  #/♦  values  observed  in  the  crystal  structures  of 
other  analogs . 


Extension  to  Proteins 

Finally  we  will  explore  the  application  of  our  methods  to  a  small  protein, 
hevein,  which  was  isolated  from  a  rubber  tree  species.8  Its  structure  is  given 
below 


E  Q 

21 

W  6 


WCGSTDEYCSPDHNCQSNC 


K  D 


It  represents  a  significant  challenge  even  though  it  is  only  a  43mer  due  to 
the  high  frequency  of  repeated  or  similar  residues.  Preliminary  control 
spectra  indicate  that  suitable  high  resolution  data  will  be  obtainable.  Dr. 
Arreguin  (Instituto  de  Quimica,  Ciudad  Universitaria  Coyoacan,  Mexico  DF)  is 
pursuing  an  independent  x-ray  crystallographic  structure  determination  which 
should  serve  as  a  check  on  the  NMR  method.  We  have  instructed  our  Mexican 
collaborators  not  to  provide  as  any  crystallographic  structural  information 
so  that  the  method  can  be  totally  independent . 

The  hevein  studies  may  be  included  in  an  upcoming  NIH  grant  application 
and  in  the  renewal  request  for  this  contract . 
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